Extending CLIP for Category-to-Image Retrieval in E-Commerce

نویسندگان

چکیده

E-commerce provides rich multimodal data that is barely leveraged in practice. One aspect of this a category tree being used search and recommendation. However, practice, during user’s session there often mismatch between textual visual representation given category. Motivated by the problem, we introduce task category-to-image retrieval e-commerce propose model for task, CLIP-ITA. The leverages information from multiple modalities (textual, visual, attribute modality) to create product representations. We explore how adding impacts model’s performance. In particular, observe CLIP-ITA significantly outperforms comparable only modality modality.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Deep Color Semantics for E-commerce Content-based Image Retrieval

This paper aims to develop a methodology to retrieve images based on fuzzy dominant colors expressed through linguistic descriptions. This process involves two steps: assigning fuzzy colorimetric profile to the image and processing the user query. People regard color as an aesthetic issue, especially when it comes to choosing the colors for their clothing, apartment design and other objects aro...

متن کامل

Category suggestion for e-commerce queries

This thesis proposes a category suggestion model for an e-commerce search engine to get an insight on the intent of the visitors. The dataset is provided by the Dutch e-commerce website Bol.com and consists of visitors queries and their subsequent clicks on items. The first part of the model is clustering the dataset to decrease the complexity and find previously unseen similarities between que...

متن کامل

Category-based image retrieval

This work presents a novel approach to content-based image retrieval in categorical multimedia databases. The images are indexed using a combination of text and content descriptors. The categories are viewed as semantic clusters of images and are used to confine the search space.

متن کامل

Extending SAR Image Despckling methods for ViSAR Denoising

Synthetic Aperture Radar (SAR) is widely used in different weather conditions for various applications such as mapping, remote sensing, urban, civil and military monitoring. Recently, a new radar sensor called Video SAR (ViSAR) has been developed to capture sequential frames from moving objects for environmental monitoring applications. Same as SAR images, the major problem of ViSAR is the pres...

متن کامل

Commerce Image Retrieval with Combination of Descriptors

It is a critical problem to find the commerce quickly and accuracy. Information from different sources can improve the retrieval performance. In this paper, we proposed a combination algorithm of color descriptor, LBP texture descriptor and HOG shape descriptor for commerce image retrieval. The commerce image retrieval experiments on commerce image dataset PI 100 indicate the combination can bo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-030-99736-6_20